Recoverable Distributed Shared Memory System with Reduced Stable Logging
نویسندگان
چکیده
This paper presents a novel logging scheme for recov-erable distributed shared memory systems. In the previous schemes, the stable logging is performed whenever a remote data value is accessed. However, in the proposed scheme, the data values are logged on the volatile storage of the writer process and only the access information between the dependent processes is logged stably. Also, to reduce the logging frequency, the access information is collectively logged either at the writer process or at the dependent processes. The performance of the proposed scheme is analyzed through extensive simulation study and the new logging scheme shows superior performance in various system environments.
منابع مشابه
An efficient causal logging scheme for recoverable distributed shared memory systems
This paper presents a causal logging scheme for the lazy release consistent distributed shared memory systems. Causal logging is a very attractive approach to provide the fault tolerance for the distributed systems, since it eliminates the need of stable logging. However, since inter-process dependency must causally be transferred with the normal messages, the excessive message overhead has bee...
متن کاملReducing Interprocessor Dependence in Recoverable Distributed Shared Memory
Checkpointing techniques in parallel systems use dependency tracking and/or message logging to ensure that a system rolls back to a consistent state. Traditional dependency tracking in distributed shared memory (DSM) systems is expensive because of high communication frequency. In this paper we show that, if designed correctly, a DSM system only needs to consider dependencies due to the transfe...
متن کاملUsing Logging and Asynchronous Checkpointing to Implement Recoverable Distributed Shared Memory
Distributed shared memory provides a useful paradigm for developing distributed applications. As the number of processors in the system and running time of distributed applications increase, the likelihood of processor failure increases. A method of recovering processes running in a distributed shared memory environment which minimizes lost work and the cost of recovery is desirable so that lon...
متن کاملAn E cient Logging and Recovery Scheme for LazyRelease Consistent Distributed Shared
Checkpointing and logging are widely used techniques to provide fault tolerance for the distributed systems. However, logging imposes too much overhead on the processing to be a practi-val solution. In this paper, we propose a low-overhead logging scheme for the distributed shared memory system based on the lazy release consistency model. Unlike the previous schemes in which the logging is perf...
متن کاملAn Improved Logging and Checkpointing Scheme for Recoverable Distributed Shared Memory
The distributed shared memory(DSM) system transforms an existing network of workstations to a powerful shared-memory parallel computer which could deliver superior price/performance. However, with more workstations engaged in the system and longer execution time, the probability of faults increases which could render the system useless. Several checkpointing and logging schemes have been propos...
متن کامل